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Huntington disease is an autosomal-dominant neurodegenerative disorder caused by a polyglutamine (polyQ) 
expansion (> 35Q) in the first exon (EX1) of huntingtin protein (Htt). mHtt protein is thought to adopt one or more toxic 
conformation(s) that are involved in pathogenic interactions in cells. However, the structure of mHtt is not known. Here, 
we present a near atomic resolution structure of mHtt36Q-EX1. To facilitate crystallization, three histidine residues (3H) 
were introduced within the Htt36Q stretch resulting in the sequence of Q 7 HQHQHQ 27 . The Htt36Q3H region adopts 
a-helix, loop, (3-hairpin conformations. Furthermore, we observed interactions between the backbone of the Htt36Q3H 
(3-strand with the aromatic residues mimicking putative-toxic interactions with other proteins. Our findings support 
previous predictions that the expanded mHtt-polyQ region adopts a (3-sheet structure. Detailed structural information 
about mHtt improves our understanding of the pathogenic mechanisms in HDand other polyQ expansion disorders and 
may form the basis for rational design of small molecules that target toxic conformations of disease-causing proteins. 



Introduction 

Huntington disease (HD) is an autosomal-dominant neurode- 
generative disorder. Neuropathological analysis of HD patients 
reveals selective and progressive neuronal loss in the striatum, 1 par- 
ticularly affecting the GABAergic medium spiny striatal neurons 
(MSN). At the molecular level, the cause of HD is a polyglutamine 
(polyQ) expansion (> 35Q) near the N-terminus of Huntingtin 
(Htt), a 350 kDa ubiquitously expressed cytoplasmic protein. 2 
The cellular mechanisms that link Htt polyQ expansion (mHtt) 
with the disease are under intense investigation. PolyQ-expanded 
mHtt is thought to adopt one or more toxic conformation (s) that 
are involved in pathogenic protein-protein interactions in cells. 3,4 
A number of toxic functions have been assigned to mHtt, includ- 
ing formation of toxic aggregates, effect on gene transcription, 
induction of apoptosis and disruption of key neuronal functions 
such as proteosomal function, ubiquitination, axonal transport, 
endocytosis, synaptic transmission and Ca 2+ signaling. 3 ' 10 Many 
of the proposed mechanisms suggest that the mHtt is involved 
in pathological interactions with other signaling proteins in cells, 
leading to neuronal dysfunction and death. 
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Information about the structure of Htt polyQ region is critical 
for understanding mHtt toxicity and may aid in the development 
of potential HD therapies. However, the structure determination 
of the polyQ region of Htt has proven to be an extremely difficult 
problem. 10,11 In the aggregated form polyQ sequence most likely 
adopts (5-sheet structure. 12 " 21 The biophysical studies of soluble 
monomeric form of polyQ fragments of various length provided 
evidence for random coil conformation, 15,22 " 24 for a-helical confor- 
mation, 25,26 for (3-sheet 19,26 and for extended helix. 27 We previously 
solved the crystal structure of the first exon (EX1) of Htt-17Q 28 
and the other group reported crystal structure of GQ 0 G peptide 
complexed with the polyQ-specific antibody. 29 However, the near 
atomic structure of mHtt with a disease-associated polyQ expan- 
sion is not known. Here, we present a near atomic resolution struc- 
ture of exon 1 of Htt-36Q with insertion of 3 histidine residues 
within 36Q stretch resulting in the sequence Q 7 HQHQHQ 27 . 
We discovered that Htt-36Q3H-EXl protein fragment adopts 
a-helix, loop and |3-hairpin conformations. Our findings support 
previous predictions that the expanded polyQ region adopts a 
(3-sheet structure and provide first near atomic resolution struc- 
ture of pathogenic form of mHtt or any other polyQ-expanded 
protein. 
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MBP3A-Httl7Q-Exl 
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MBP3A-Htt36Q3H-Exl 



M371 F387 P427 0438 Q452 

MBP-AAA^4ATLEKLMKAFESLKSFyWWOQHQaQHQQOQOQQQ<10^QQQOOQOOQQQQQOOQPPPPPPPPPPPQLPQPPPQAQPLLPQQSYOJ^AGXLG^GRRf'T^S 




Figure 1. The structure of MBP-HU36Q3H-EX1. (A) Amino acid sequence of MBP-HU17Q-EX1 28 and MBP-Htt36Q3H-EX1 expression constructs. MBP3A 
denotes the maltose binding protein followed by a three alanines linker. In both constructs position M371 corresponds to M1 in Htt sequence. The 
sequence of Htt-Ex1 is composed of N-terminal N17 region (green), polyQ region (orange), PolyP region (blue) and mixed P/Q region (purple). Three 
His residues (pink) are inserted within poly36Q stretch in MBP-Htt36Q3H-EX1 construct. In both constructs identical 19 amino acids C-terminal tag 
was added to facilitate crystallization (black). (B) The trimer of MBP-HU36Q3H-EX1 . (C and D) Htt36Q3H-EX1 trimers in the crystal XI (C) and X2 (D). On 
panels (B-D) the MBP protein (gray), 3A linker (gray), Htt-N17 (green) and Htt-polyQ (orange) regions are shown for each of the three MBP-Htt36Q3H- 
EX1 molecules in the asymmetric unit of the crystal. 



Results 

Crystallization of Htt36Q exon 1 fragment. In the previous 
study we used protein crystallography approach to determine near 
atomic resolution structure of exon 1 of Httl7Q (Httl7Q-EXl). 28 
In this study Httl7Q-EXl protein fragment was expressed, puri- 
fied and crystallized as N-terminal MBP-fusion protein with addi- 
tion of 19 amino-acids C-terminal tag 28 (Fig. 1A). In the present 
study we aimed to use the same approach to solve the structure of 
mHtt exon 1 containing expanded glutamine repeat region. By 
following procedures described in the previous study 28 we gener- 
ated, expressed, purified and crystallized MBP-Htt36Q-EXl pro- 
tein with C-terminal tag. However, resulting crystals diffracted 
at resolution less than 6 A, preventing structure determination of 
Htt36Q-EXl protein (data not shown). We reasoned that poor 
diffraction quality of MBP-Htt36Q-EXl crystals is due to a ran- 
dom helix-to-loop transition within 36Q region, as observed in 
our previous structural studies of Httl7Q-EXl. 28 To reduce the 
randomness within polyQ structure, we inserted XQXQX inter- 
rupter sequence within the Htt-polyQ region in order to either 
break or stabilize the ct-helical conformation of Htt-polyQ. As an 
X amino acid we selected potential helix breaker Gly and potential 
helix stabilizer Ala. In addition, we also tested turn-promoting 
His. We selected His because insertion of His within a polyQ 
stretch in Ataxin-1 exerts strong disease-modifying effect in 



SCA1 patients, 30 " 32 and a number of detailed biophysical studies 
have been performed with polyQ peptides containing His inser- 
tions. 33 " 35 The XQXQX interrupters (X = G, A, H) were intro- 
duced at several locations within Htt-polyQ-EXl constructs. The 
polyQ constructs of various length (17Q, 23Q, 36Q and 45Q) 
were generated in our experiments. From all constructs that we 
tested, only MBP-Httl7Q3A-EXl (first X at position 8) and 
MBP-Htt36Q3H-EXl (first X at position 8) yielded diffrac- 
tion quality crystals (Table SI). The Httl7Q3A protein adopted 
a-helical conformation in the crystals (data not shown), in agree- 
ment with our previous studies of Httl7Q-EXL 28 Thus, in order 
to explore the Htt-polyQ conformational space further we focused 
on Htt36Q3H-EXl construct for our further studies. 

The overall design of Htt36Q3H-Exl expression construct 
(Fig. 1A) was similar to the design of the Httl7Q-EXl expres- 
sion construct used in the previous study. 28 The N-terminal MBP 
molecule was connected with Htt36Q3H-Exl via 3 alanines 
(3A) linker (Fig. 1A). The amino-acid (aa) numbering in MBP- 
Htt36Q3H-Exl construct starts with MBP, so that Metl in the 
Htt corresponds to Met371 in the MBP-Htt36Q3H-Exl. The 
N-terminal region of Htt36Q3H-Exl starting at Met371 (Metl 
in Htt sequence) extends for 17aa until Phe387 (Fig. 1A). The next 
region Gln388-Gln426 contains a stretch of 36Gln (poly36Q) 
with addition of 3 Histidines and immediately followed by a 
stretch of 11 Pro from Pro427 until Pro437 (polyllP) (Fig. 1A). 
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The proline stretch is followed by 15aa mixed Pro and Gin regions 
(polyP/Q) that spans from Gln438 until Gln452 (Fig. 1A). The 
native Htt sequence ends at Gln452 and followed by an exogenous 
C-terminal 19 aa tag added to facilitate crystallization (Fig. 1A). 

MBP-Htt36Q3H-EXl protein was expressed, purified and 
crystallized according to the procedures described in our previous 
paper 28 (see Experimental Procedures for details) . X-ray diffraction 
data were collected for several crystals of MBP-Htt36Q3H-EXl, 
with the best crystals diffracting up to 2.5 A resolution. All crys- 
tals were monoclinic, space group C2. The asymmetric unit 
contained 3 molecules of MBP-Htt36Q-EXl (designated as mol- 
ecules A, B, C) in all crystals. The phasing was done for the two 
best crystals (XI and X2) by molecular replacement (MR) using 
MBP structural models (see Experimental Procedures for details). 
MR gave solutions for both crystals with complete maps for MBP 
and partial maps for Htt36Q3H-EXl. The detailed procedures 
used to generate the maps are described in Methods section. 

The crystallographic parameters for XI and X2 crystals are 
listed in Table 1. In the process of refinement we discovered 
that molecule C adopts dual conformation in both crystals. The 
model for each of these conformations (CI and C2) was built and 
refined independently. The final refinement statistics for MBP3A- 
Htt36Q3H-Exl structure determination in XI andX2 crystals are 
summarized in Table 1. The packing of 3 MBP-Htt36Q3H-Exl 
molecules in the asymmetric unit (Fig. IB) was nearly identical 
to the trimer formed by 3 MBP-Httl7Q-Exl molecules in the 
previous study. 28 

Secondary structure of the Htt36Q3H-EXl. The ribbon 
structure diagrams of Htt36Q3H-Exl trimers resolved in crystals 
XI and X2 are shown on Figure 1C and D. All 6 molecules com- 
posed of identical a -helical N17 N-terminal Htt region (green) and 
polyQ regions (orange) which adopted multiple conformations in 
the crystals. Similar to the previous structure of Httl7Q-Exl, 28 
the trimer packing of Htt36Q3H-Exl molecules is stabilized by 
hydrophobic interactions between Leu and Phe residues from Htt- 
N17 regions of molecules A, B and C. The polyQ region initially 
adopts a-helical structure, which extends for 7 Gin until Gln394 
residue in all 6 molecules (Fig. 1C and D). The Htt36Q3H struc- 
tures starting from Gln396 adopt diverse conformations as sum- 
marized in Table 2. The A and B molecules in both crystals adopt 
(3-hairpin conformations starting from Gln396 (Table 2). The 
C molecules in both crystals adopt dual conformations (CI and 
C2), which have been modeled independently. The CI conforma- 
tions in crystal XI and X2 are an a-helix and a loop respectively, 
while C2 conformation in both crystals is the hairpin (Table 2). 

Multiple conformations of polyglutamine region in 
Htt36Q3H-EXl. The ribbon-stick structures of C molecules in 
a-helical and loop conformations are shown on Figure 2A and B 
together with the supporting electron density difference maps 
(2Fo-Fc) . The a-helical conformation of polyQ region of CI mol- 
ecule in crystal XI extends until Gln4l0 according to the elec- 
tron density map (Fig. 2A; Table 2). In contrast, for CI molecule 
in crystal X2 the ct-helix breaks from H395 and becomes a loop 
starting at Q396 (Fig. 2B; Table 2). The loop can be resolved until 
Q407 according to the electron density map (Fig. 2B). The polyQ 
regions in a-helix and loop conformations have been observed in 



Table 1. Diffraction data collection and refinement statistics for two 
crystals used for MBP-HU36Q3H-EX1 structure determination 



Crystal 


XI 


X2 


Data collection 


Wavelength (X) 


0.98 


0.98 


Space group 


C2 


C2 


Unit cell parameter (A, °) 


a 


155.052 


153.182 


b 


177.278 


177.357 


c 


78.868 


78.346 


P 


109.025 


108.674 


Resolution limit (A) 


2.8-40.0/2.8-2.9 


2.8-40.0/2.8-2.9 


R (%) 


11.0/53.2 


6.6/65.3 


l/crl (overall/outer 
shell) 


27.9/2.1 


17.2/1.0 


Redundancy (overall/ 
outer shell) 


3.6/3.1 


3.6/2.4 


Overall completeness 

(%) 


99.2 


99.2 


Refinement 


XI -C1 XI - C2 


X2-C1 X2-C2 


B-factor 


Overall (A 2 ) 


54.3 55.8 


56.2 56.1 


Solvent 


46.9 47.7 


27.3 30.8 


No of data 


40324 


39566 


^Worl/^Free 


0.23/0.27 0.23/0.27 


0.23/0.27 0.23/0.27 


RMS_D (A) 


0.006 0.006 


0.005 0.005 


RMS_A (°) 


0.912 0.912 


0.789 0.789 


Ramachandran map 


Favored (%) 


97.2 97.4 


97.1 96.9 


Allowed (%) 


2.6 2.5 


2.7 2.7 


Outliner (%) 


0.2 0.1 


0.2 0.4 


PDB accession 
number 


4FE8 4FEB 


4FEC 4FED 



Table 2. Conformations adopted by the Htt36Q3H region starting at 
position Q396 in MBP-Htt36Q3H-EX1 for crystals XI and X2 



Crystal 


Molecule 


Residues 


Conformation 




A 


396Q-401Q 


Hairpin 


XI 


B 


396Q-401Q 


Hairpin 


CI 

C 

C2 


396Q-410Q 


a-helix 




396Q-401Q 


Hairpin 




A 


396Q-401Q 


Hairpin 


X2 


B 


396Q-401Q 


Hairpin 


CI 

C 

C2 


396Q-407Q 


Loop 




396Q-401Q 


Hairpin 
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Figure 2. The Htt36Q3H region in a-helical, loop and (3-hairpin conformations. (A) The a-helical structure of Htt36Q3H (Q388-Q410) (CI molecule 
from crystal XI). (B) The a-helical structure of Htt36Q3H (Q388-H395) followed by a loop (Q396-Q407) (CI molecule from crystal X2). (C) The a-helical 
structure of Htt36Q3H (Q388-Q394) followed by a (3-hairpin (H395-Q401) (C2 molecule from crystal XI). On (A-C) Htt-N17 (green), Htt-polyQ (orange) 
and His residues (pink) are labeled and numbered on the ribbon diagram. The supportive electron density maps (2Fo-Fc) at 1cr are shown by a green 
mesh. 




our previous analysis of Httl7Q-Exl structure. 28 Both a-helical 
and loop conformations of Htt36Q3H-Exl (Fig. 2A and B) and 
Httl7Q-Exl 28 are located in the solvent area and lack any obvious 
protein interactions. 



In addition to od-helix and loop conformations, we discovered 
a 3-hairpin conformation of Htt36Q3H region. The example 
of such conformation with the supporting electron density map 
(2Fo-Fc) is shown on Figure 2C. These structural elements have 
not been observed in our previous analysis of Httl7Q-Exl, 28 
but they have been discovered for Q396-Q401 region of most 
Htt36Q3H-Exl molecules (Table 2). The example of B-hairpin 
structure formed by Q396, H397, Q398, H399, Q400, Q401 is 
shown in details on Figure 3. The B-hairpin consists of 3-strand 
(Q396 and H397), turn (Q398 and H399) and B-strand (Q400 
and Q401) (Fig. 3, diagram) . This hairpin is stabilized by intramo- 
lecular double hydrogen bonds between H397 and Q400 (Fig. 3). 
We propose that the transition of Htt-polyQ conformation from 
a-helix (Fig. 2A) to loop (Fig. 2B) to P-hairpin (Fig. 2C) cap- 
tured in our crystals provides a plausible sequential conforma- 
tional transition from a-helical to B-sheet conformation in mHtt. 
These transitions have not been previously observed for Htt or for 
any other protein using near atomic resolution techniques. 

Intermolecular protein interactions of polyglutamine region 
in Htt36Q3H-EXl. In contrast to "inert" a-helical and loop 
conformations of Htt36Q3H-Exl located in the solvent region 
(Fig. 2A and B), the B-hairpin of Htt36Q3H-Exl was 
resolved fastened to the aromatic residues in the MBP protein 
(Figs. 2C and 3). The hairpin in these crystals was stabilized by 
intermolecular interactions between the amine of amide back- 
bone (HN b ) of Q398, H399, Q400 and Q401 in Htt and aro- 
matic rings of Y155, Y210 and W230 in MBP (Fig. 4). The amide 



Figure 3. The Htt36Q3H region in (3-hairpin conformation. The hairpin 
structure of Htt36Q3H (H395-Q401) (C1 molecule from crystal XI). The 
Gin (orange) and His (pink) residues are labeled and numbered on the 
stick diagram. The hydrogen bonds are shown by a dotted line. The 
Y210 and W230 proximal residues from MBP are also shown. The sup- 
portive electron density maps (2Fo-Fc) at 1<j are shown by green mesh 
for Htt36Q3H. The diagram of structure is depicted in a schematic form 
on the right panel. 
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Figure 4. Intermolecular interactions between (3-hairpin of Htt36Q3H and aromatic residues in MB P. (A) The interaction between the amide backbone 
of Q398 in Htt36Q3H and Y155 in MBP. (B) The interactions between the amide backbones of H399 and Q400 in Htt36Q3H and W230 in MBP. (C) The 
interactions between the amide backbone of Q401 in Htt36Q3H and Y210 in MBP. On panels (A-C) the interactions are shown for C1 molecule in 
crystal X1 at different angles. The Gin (orange) and His (pink) residues are labeled on the stick diagram. The interacting residues are numbered and the 
interactions are shown with a dotted arrow line with interaction distances indicated in angstroms (A). The interaction distances were measured from 
the backbone of the residues in the Htt36Q3H p-strand to the center of the aromatic ring residues in MBP. The supportive electron density maps 
(2Fo-Fc) at 1cr are shown by a green mesh for Htt38Q3H and a white mesh for MBP. 



backbone (HN b ) of each residue interacts with aromatic rings in a 
parallel configuration (Fig. 4). In this geometry HN b is capable to 
form two simultaneous hydrogen bonds, intramolecular with the 
carboxyl on the other side of the Htt hairpin and intermolecular 
with the aromatic ring provided by the MBP amino acids. This 
type of interactions between aromatic rings and amide backbone 
(Ar-HN b ) has been observed in many protein structures previ- 
ously. 36 " 39 The interaction distances between HN b and aromatic 
rings are in the range 4.3—4.5 A (Fig. 4), which is a typical distance 
for Ar-HN b interactions. 36 " 39 The Ar-HN b interaction is a weak 
polar interaction with an interaction energy of approximately 1.5 
Kcal/mol, and usually multiple Ar-HN b interactions are needed to 
stabilize the ternary local structure of a protein. 39 Consistent with 
these general features, there are four parallel Ar-HN b interactions 
between (3-hairpin of Htt36Q3H-Exl and aromatic residues of 
MBP (Fig. 4). 

Discussion 

There are a number of conclusions that can be reached from the 
analysis of the Httl7Q-EXl 2s and Htt36Q3H-EXl (present 
study) structures. Both structures indicate that most N-terminal 
fragment of Htt protein (N17 region) exists as a-helix. This con- 
clusion is also in agreement with previous computational and 
biochemical analysis of Htt-N17 region, 40,41 but not with the 
solution NMR studies. 42 However, recent magic-angle-spinning 
solid-state NMR studies indicated that Htt-N17 domain is in 
a-helical conformation, in agreement with our results. 20 Both 
crystal structures further indicate that polyQ region can also exist 
in a-helical configuration, in agreement with CD spectra analy- 
sis 25,26 ' 43 and modeling results. 44 " 46 However, in both structures 
the polyQ region was "conformationally unstable," indicating 



that polyQ ct-helix could easily unfold into a loop. A very shal- 
low equilibrium between a-helical and loop conformations of 
Htt-polyQ is in agreement with various conformations reported 
for the same region by molecular dynamics stimulations, by bio- 
physical measurements and by epitope mapping. 15,20,22 " 27,43,46 " 49 In 
the Httl7Q-Exl structure helix-to-loop transition occurred at 
random positions within polyl7Q region. 28 In Htt36Q3H-EXl 
structure helix-to-loop transition was facilitated by insertion of 
three histidine residues after Q394 within polyQ stretch. Thus 
we conjectured that the "synchronized" transition from a-helix 
to loop at H395 in most Htt36Q3H-Exl molecules increased dif- 
fraction quality of the Htt36Q3H-EXl crystals. Consistent with 
this hypothesis, we could not obtain well diffracted crystals for 
Htt36Q-EXl protein without His insertion. 

In some Htt36Q3H-Exl molecules the polyQ region main- 
tained an a-helical structure after His395 (Fig. 2A) and in other 
molecules remaining portion of polyQ region adopted a loop 
conformation (Fig. 2B). Both a-helical and loop conformations 
of Htt-polyQ were also observed in the Httl7Q-Exl structure. 28 
The a-helical and loop conformations of Htt-polyQ were located 
in the solvent region and were not involved in any protein-protein 
interactions in both Htt36Q3H-Exl and Httl7Q-Exl structures. 
Thus, we confirmed our previous conclusions 28 that the a-helical 
and loop conformations of polyQ region are likely to be "inert" 
and non-pathogenic. 

When compared with the previous analysis of Httl7Q-EXl 
structure, 28 the main new discovery in the present study is the 
ability of expanded polyQ region in Htt36Q3H-EXl to adopt 
(3-hairpin conformations. The (3-strand structures observed in 
our crystals for Htt36Q3H were stabilized by an intramolecu- 
lar hydrogen double bond (Fig. 3). It is likely that the hairpin 
structures constitute a nucleation site for the anti-parallel (3-sheet 
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formed by expanded polyQ tracks. Such (3-sheet structure was 
predicted for expanded polyQ based on biophysical and com- 
putational analysis. 12 " 21,26,46 We propose that the transition of 
Htt36Q3H from ot-helix (Fig. 2A) to loop (Fig. 2B) to (3-hairpin 
(Fig. 2C) captured in our crystals provides a plausible sequential 
conformational transition from a-helical to (3-sheet conformation 
of expanded polyQ region in mHtt. These transitions have not 
been previously observed for Htt or for any other protein using 
near atomic resolution techniques. 

In contrast to "inert" a-helical and loop conformations of 
polyQ, we discovered that all hairpin structures of Htt-polyQ 
were stabilized by additional interactions with aromatic residues 
in the MBP protein (Fig. 4). It is well established that the pres- 
ence of aromatic residues in a target protein facilitates interactions 
with expanded polyQ proteins, such as in the case of QBP1 pep- 
tide 50 or HQP09 peptoid. 51 Our results indicate that these inter- 
actions most likely mediated by Ar-HN b interactions between a 
backbone of Htt-polyQ exposed in the (3-strand structure and 
the aromatic residues in the mHtt ligands (Fig. 4). We uncovered 
similar Ar-HN b interactions in the recently reported crystal struc- 
ture of extended conformation of GQ 0 G peptide complexed with 
the polyQ-specific antibody. 29 The potential interactions in this 
structure (pdb structure file 2GTW) are between backbones of 
Q4 and Q7 in the GQ 0 G peptide and Y102 and Y31 residues in 
the antibody, with the measured distance of 4.0 A for both inter- 
action pairs (data not shown). Similar interactions are likely to 
be involved in association of mutant Htt with Hsp70 and Hsp40 
chaperons. 52 We suggest that the presence of the MBP aromatic 
residues at positions optimal for formation Ar-HN b contacts with 
HQHQH (3-hairpin residues may have facilitated crystallization 
of MBP-Htt36Q3H-EXl. This may explain why insertion of 
HQHQH in other positions within MBP-Htt36Q-EXl has not 
resulted in the diffracting crystals (Table SI). We reasoned that 
association between HN b backbone groups in polyQ (3-strands 
and aromatic residues in biologically unrelated proteins (such as 
MBP in our crystals) is a mimic for pathogenic interactions of 
mHtt in cells. We further propose that the observed (3-hairpin 
structure constitutes a seed for (3 -sheet conformation of expanded 
polyQ in a mHtt. This conclusion is in agreement with the recent 
biochemical studies, 19 solid-state NMR analysis 20 and computa- 
tional modeling studies. 46 

One potential limitation of the approach used in our stud- 
ies is that HQHQH sequence was inserted into Htt36Q region. 
Interruption of polyQ track in Ataxin-1 by His insertion is known 
to affect pathogenesis of SCA1. 30 " 32 Moreover, it was directly dem- 
onstrated that centrally inserted HQH sequence slows down the 
rates of Q30 aggregation in vitro. 33 However, biophysical and mod- 
eling studies suggest that insertion of His residues within polyQ 
track does not result in major alterations in the polyQ structure. 
Indeed, it has been demonstrated that K 2 Q 5 HQHQ 5 K, pep- 
tide is able to seed formation of K 2 Q 3() K 2 aggregates as well as 
K 2 Q 30 K 2 peptide itself. 33 These results suggest that insertion of 
HQH sequence slowed down aggregation of K 2 Q 5 HQHQ 5 K 2 
peptides but did not altered properties of resulting aggregates. 
The (3-hairpin structure observed in our crystals (Figs. 2C and 
3) is in complete agreement with intramolecular (3-sheet structure 



proposed for HQH peptides based on spectroscopic studies. 34,35 As 
discussed by these authors, the pure polyQ peptides have tendency 
to form large intermolecular (3-sheet structures through extended 
hydrogen bonding and thus tend to aggregate quickly. 34,35 In con- 
trast, polyQ peptides with His interruptions demonstrate a greater 
propensity for intramolecular hydrogen bonding and greater solu- 
bility. 34,35 Most likely these physical properties of His-interrupted 
polyQ peptides facilitated formation of well diffracted MBP- 
Htt36Q3H-EXl crystals and enabled us to solve the structure of 
(3-hairpin at high resolution (Figs. 2C and 3). 

Materials and Methods 

Cloning, protein expression, purification, crystallization and 
X-ray diffraction data collection. The first exon (EX1) fragment 
of human Htt containing polyQ tracks of various length (17Q, 
23Q, 36Q and 45Q) were synthesized by GenScript. XQXQX 
interrupters (X = G, A, H) were introduced after 7th, 9th, 10th, 
12th and 15th glutamine residue within a polyQ track (Table SI). 
The 19 aa tag (QSY QIT AGK LGT GRR FTT S) was added at 
C-terminal of each construct as previously described. 28 Generated 
constructs were cloned into NotI and PstI sites of the pMAL vec- 
tor with modified 3Ala-linker. 58 The amino-acid (aa) numbering 
in MBP-HttPolyQ-EXl construct starts with MBP, so that Metl 
in the Htt corresponds to Met371 in the MBP-HttPolyQ-EXl. 
The MBP-HttPolyQ-EXl proteins were expressed and purified 
according to the procedures described in our previous paper. 28 
Briefly, the MBP-HttPolyQ-EXl protein was expressed in BL21 
bacteria and purified on Maltose-binding affinity column, fol- 
lowed by molecular exclusion chromatography (Superdex200 
from Pharmacia Biotech) using FPLC. The purified MBP- 
HttPolyQ-EXl proteins were crystallized in a hanging drop 
without removing MBP or C-terminal tag in order to improve 
protein solubility and to facilitate crystallization. Among all 
constructs tested the best diffracting crystals were obtained for 
MBP-Htt36Q3H-EXl protein with HQHQH insertion after 
7th glutamine residue (Table SI). For MBP-Htt36Q3H-EXl 
proteins single crystals in size of 50/50/200 ixm grew in the 
mother drop containing 15% polyethylene glycol MW12K, 
100 mM Zn Acetate, 200 mM sodium floride, 100 mM sodium 
cacodylate pH = 6.5 to 7.4 at 4 to 16°C in 6 to 12 weeks. To 
check protein integrity, MBP-Htt36Q3H-EXl crystals have been 
harvested, washed three times in the crystal-mother-solution buf- 
fer and analyzed by SDS-PAGE gel electrophoresis. The crystals 
with mother liquid in a loop were directly frozen at liquid nitro- 
gen temperature. X-ray diffraction data from the frozen crystals 
were collected at APS 19ID and ALS B18.2.2. The diffraction 
data collected from several MBP-Htt36Q3H-Exl crystals were 
indexed and processed by HKL2000 and two of best diffracted 
data sets (crystals XI and X2) were selected to solve the MBP- 
Htt36Q3H-Exl structures. 

Phasing by molecular replacement. All crystals of MBP- 
Htt36Q3H-EXl had monoclinic symmetry, space group C2, 
with three molecules of MBP-Htt36Q3H-EXl in the asymmet- 
ric unit (ASU). For each crystal, phasing was done by molecular 
replacement (MR) using the Phaser program. Forty MBP pdb 
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files were downloaded from the protein databank and different 
permutations of 3 MBP models were used as possible models for 
three proteins in the ASU. The best solutions for both data sets 
(XI and X2 crystals) from MR consisted of a trimer of ILLS 
models for MBP. The two crystals were nearly isomorphous but 
insufficient for averaging them. Thus, the structure determination 
was performed separately for each crystal. The electron density 
maps generated by MR consisted of MBP map connected to the 
electron density representing Htt36Q3H-EXl. The part of the 
electron density corresponding to MBP protein and N17 region 
of Huntingtin were resolved at 2.8 A. The part of the map cor- 
responding to polyQ region of Htt36Q3H was less clear, indi- 
cating that this part of the protein adopts multiple conformation 
in the crystals. This part of the map was improved in the pro- 
cess of iterative model building with Coot and refinement with 
Refmac. However, only backbone structure could be resolved for 
this region, forcing us to build the models initially just for the 
backbone of the protein. The side-chains were incorporated into 
the backbone model by using Ramanchadran plot and side-chain 
polarity information. In order to validate the resulting model we 
calculated Omit map using CCP4-6.2 (Fig. SI) and determined 
map-model correlation coefficient (Table S2). Table 1 summaries 
the properties of the two crystals and corresponding diffraction 
data sets used in solving Htt36Q3H-EXl structure. 

Model building and refinement. When the two maps gener- 
ated by MR for XI and X2 crystals were compared, it became 
apparent that the Htt36Q3H-EXl fragments adopt slightly dif- 
ferent configuration in the two crystals. To account for this fact, 
the models of Htt36Q3H-EXl fragments in the two crystals were 
built separately but simultaneously. As these models were built, 
it became apparent that the molecule C in each trimer existed 
in a double conformation (CI and C2). To account for a double 
conformation of the C molecule, the CI and C2 models were built 
separately and refined with the occupancy in the range between 

0. 70.0.60 for each model to yield the best K^ otk and R Free factors 
during refinement using refmac5. In the process of model building 
and refinement the R^^ and R Free were improved and the electron 
density map of each Htt36Q3H-EXl became more clear. Table 1 
summarizes the final refinement statistics for the two crystals with 
CI and C2 conformations used for Htt36Q3H-EXl structure 
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